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Abstract 

This paper shows how the steady-state availability and failure frequency can be calculated in 
a single pass for very large systems, when the availability is expressed as a product of matrices. 
We apply the general procedure to fc-out-of-n:G and linear consecutive /c-out-of-n:F systems, and 
to a simple ladder network in which each edge and node may fail. We also give the associated 
generating functions when the components have identical availabilities and failure rates. For large 
systems, the failure rate of the whole system is asymptotically proportional to its size. 

This paves the way to ready-to-use formulae for various architectures, as well as proof that the 
differential operator approach to failure frequency calculations is very useful and straightforward. 

Index Terms 

network availability, failure frequency, failure rate, /c-out-of-n systems, generating function 

Acronym 

GVI grouped variable inversion (method) 
SVI single variable inversion (method) 
IE inclusion-exclusion (principle) 

OBDD Ordered Binary Decision Diagram 
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2. 

Notation 

Pi, qi [success, failure] probability of component % 

(qt = 1 - pd 
p, q implies Pi = p, qi = q (for edges). 
p identical availability of nodes (when p ^ p). 
Aj, pi [failure, repair] rate of component i 
A, p common [failure, repair] rate of components 
A steady-state availability of the system 
U steady-state unavailability of the system 
V mean failure frequency of the system 
A mean failure rate of the system (V = A A) 




Q(z) generating function for the availability 
Q{z) generating function for the failure frequency 
A kjJl availability of a fc-out-of-n:G system 

I. Introduction 

Steady-state system availability and failure frequency are important performance indices 
of a repairable system [1], [2], [3], [4], from which other key parameters such as the mean 
time between failures, average failure rate, Birnbaum importance, etc. may be deduced. In 
the steady-state regime, the frequency of system failure was first calculated by a cut-set [5] 
or a tie-set approach [6] in the case of statistically independent failures, which will also be 
considered here. These approaches are based on the inclusion-exclusion (IE) principle, where 
the failure or repair rates (more generally, the inverses of the mean down or up times), are 
adequately given for each term of the relevant expansion. 

When all the terms of its IE expansion are kept, the exact availability is obtained as a 
function of each component availability. Several papers have provided a few simple recipes, 
describing how the system failure frequency and the failure rate can then be derived [7], [8], 
[9]. Recent refinements have been proposed when availability expressions are obtained from 
various instances (SVI, GVI) of sum-of-disjoint-products algorithms [10]. All these formal 
calculations boil down to a simple fact: the failure frequency may be derived from the 
availability through the application of a linear differential operator [11], [12]. This requires 
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knowledge of the exact availability, which is hard to come by except for trivially small 
networks, and may have hindered the use of this method. 

Unsurprisingly, several algorithms have been put forward, in which availability and failure 
frequency are computed side by side in a common procedure: triangle-star transformation 
[13], OBDD calculations [14], and another instance of differential operator calculations [15]. 

In this paper, we want to promote the differential operator method for the calculation 
of the failure frequency by showing it gives the exact result for numerous, widely used 
configurations, with an arbitrary large number of components. We take advantage of recent 
results establishing that the availability of recursive networks may be expressed as a product 
of transfer matrices that take each edge and node availabilities exactly into account [16], 



Our paper is organized as follows. In Section HH, we show how the failure frequency 
of a system may generally be deduced from the steady-state availability when the latter is 
expressed by a product of transfer matrices. We first apply this method in Section [Till which 
is devoted to k-out-of-n systems (either A;-out-of-n:G or linear consecutive A;-out-of-n:F ones) 
with distinct components. Section JV] provides a generic example for the two-terminal failure 
frequency of a simple ladder network, which has been solved recently for arbitrary edge and 
node availabilities [16]; the same procedure could easily be used for more complex networks 
and their all-terminal reliability too [17], [18]. In each configuration, we pay attention to the 
case of identical components, for which the common availability is p (for edges) and p (for 
nodes). For large systems, we show that the asymptotic failure rate has a linear dependence 
with size, and is given by derivatives of the largest eigenvalue of the unique transfer matrix 
with respect to p and p. We conclude by a brief outlook. 



In many systems, as will be explicitly shown in the following sections, the availability A 
(or the unavailability U) is given by an expression of the form 



where M k (1 < k < n) is a transfer matrix, the elements of which are multilinear polynomials 
of individual component availabilities, and where wl and v# are two vectors in which these 
availabilities do not appear. The mean failure frequency V is obtained from [11], [12] 



[17], [18]. 



II. General procedure 



A = v L M n ■ ■ ■ 



M lVR 



(1) 




(2) 
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In order to avoid unnecessarily heavy notation, we call M' k the matrix obtained by applying 

Q 

the linear differential operator > \pi —— to M k . Therefore, 

" dpi 

V = vl M' n M n _i ... MiVfl 
+v L M n M' n _ x ■ ■ ■ M x v R 
+ ■■■ 

+v L M n M n _! • • • M[ v R . (3) 

Since M fc 's elements are at most linear functions of each pi, the derivation of M' k is straight- 
forward. For instance, a matrix element p x + p 2 Ps — p\ P2 P3 in M k would give rise to 
AiPi + (A 2 + A3)p2P3-(Ai + A 2 + A3)pip 2 p 3 ; the recipes given in [7], [8], [11] fully apply. 

Both availability and failure frequency may be obtained in a single pass in the following 
way. Let us initialize the procedure by setting 

Ai = MiVfl, (4) 

Vi = M[v R . (5) 

The recursion equations are 

A k = M k A k -i, (6) 

V k = M k V k -! + M' k A k . u (7) 

from which we deduce the final results 

A = Vl An , (8) 
V = v L V„. (9) 
We can now turn to a few 'real-life' applications. 

III. /c-OUT-OF-n SYSTEMS 

/c-out-of-n systems are widely used, in various configurations; they have therefore con- 
tributed to a huge body of literature (see [4], [19], [20] and references therein). We start 
our discussion with these systems because each transfer matrix actually refers to a single 
equipment only. 
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A. k-out-of-n:G systems 

We first consider the simple £>out-of-n:G system, where each component has an availability 
Pi (1 < i < n). To operate as a whole, the system needs at least k elements to function. Its 
availability A k ^ n may be written as (see [4], p. 244) 

/ 1 \ 
1 



A 



k.n 



(1,0, 



0) fc A n A n _i Aj 



(10) 



V 1 / 



with 



A,; 



f q% p% 

o qi 



\ 





Pi 



... \ 



■•. 

qi Pi 

% / 



(11) 



kxk 



/-A, 



A' 



Pi 



\ 



We have reduced the size of the matrix to a k x k one, instead of the original [k + 1) x {k + 1 
because of the nature of = (1, 0, . . . , 0)fc and vr in eq. (flOl) . 
The 'derivative' of A, is 

XiPi 
-XiPi XiPi 


: -XiP-, 
••• 








V o 



(12) 







Kpi 

so that the computation of the failure frequency following the method given in section HI is 
straightforward (care should of course be taken of the minus sign in eq. (fTOl)). 

Let us revisit Example 7.2 of [4] (see p. 245) for the 5-out-of-8:G system with pi = 0.90, 
0.89, 0.83. Assuming a unique repair rate for all components, namely /i, the failure rates 
Xi are such that Xipi = fi (1 — Pi). From the procedure detailed in Section UH we deduce 
A ^ = 625000000000 « 0.98548045 and a failure frequency V 5fi = /x « 0.051283^. 

The failure rate A 5i8 = I7 5 g/^5 § is then equal to 0.0520382 



When all components are identical (pj = p and Aj = A), only one transfer matrix appears. 
Admittedly, A^ n is so simple that a matrix formulation is hardly necessary. Nonetheless, we 
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can give a compact expression for the generating function Gk{ z ) — ^-k,n z n (the derivation 

n=0 

is given in the appendix): 

b h 
P Z 

g k {k-out-oi-n:G; z) = r— r — r . (13) 

(1 - z) (1 - (1 -p) z) k 

Since the generating function is a formal power-series expansion, we can apply the linear 

oo 

differential operator Xp to eq. (fl"3l so that £?fc(z) = ^^z^fc.n -2 n is easily found to be 



n=0 

k Jt 



C/(/c-out-of-n:G; z) 



A kp z 



(l-(l-p) z) 



fe+i 



(14) 



n 



which is another formulation of the well-known result u k>n = X k | ] p k (1— p) n k (eq. (7.10) 
of [4], p. 234). 

B. Linear consecutive k-out-of-n:F systems 

These systems have been studied in many papers [19], [20] and a recent textbook [4]. The 
reliability A kjn — the probability of operation of a system of n components, which fails if 
at least k consecutive elements fail — of such a system is given by (see also eq. (9.48) of 
[4], p. 344) 

/l\ 
1 



A 



k.n 



1,0, ...,0) fe A n A n _iAi 



(15) 



V 1 / 



with 



A; 



' Pi 


Qi 





... o 


\ 


Pi 





Qi 


o ; 











■-. 




Pi 







qi 




V Pi 








... o 


/ 



(16) 



kxk 

Here again, we have reduced the size of the matrix and the vectors with respect to their 
original formulation. Consequently, 

/ XiPi -XiPi ••• \ 
Xipt -XiPi : 

; ; o ■•. 

XiPi •■■ -XiPi 

\ x iPi ••• A , 



A' 



(17) 
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leading once again to a straightforward calculation of the failure frequency. 

A numerical application may be found in the Lin/Con/4/1 1:F model, as in Example 9.6 of 
[4], where the p^s range from 0.7 to 0.9 by steps of 0.02. Assuming again that the repair 
rate for each equipment is \i, we get A 4jll = 30517573125000 ~ 0.98649329 and a failure 
frequency z7 4> n = 30517578125000 ^ ~ 0-050953 fi. The corresponding failure rate is then equal 
to 0.0516505 fi. 

For the sake of completeness, we give the generating function for identical components is 
(see eq. (2.2) of [21], [22]) 

£ fc (Lin/Con/A;M:F; z) = 1 - ^ ~ ^ * — . (18) 

v ' ' 1 - z + p(l -p) k z k+1 

Use of eq. (fl"8l to obtain the failure frequency generating function is straightforward, and 
will not be repeated here. 

IV. Simple ladder 

We consider in this section the two-terminal availability of a simple ladder network, 
displayed in Fig. [U where successive nodes are labelled Si or Tj, and where the larger 
black dots mark the source s and terminal t. This network is a simplified description of 
a standard architecture for long-hail communication networks: it consists in primary and 
backup paths, plus additional connections between transit nodes enabling the so-called "local 
protection" policy by bypassing faulty intermediate nodes or edges. Such an architecture of 
"absolutely reliable nodes and unreliable edges," with up to 25 edges, was chosen as Example 
5 in [23] for a comparison of different "sum of disjoint products" minimizing algorithms, or 
by Rauzy [24] as well as Kuo and collaborators in OBDD test calculations [25], [26], [27]. 
We showed [16] that the two-terminal availability has a beautiful algebraic structure [28], 
since its exact expression is given by a product of 3 x 3 transfer matrices (see eqs. (|T9l42Tb 
below). Consequently, it can also be determined for a network of arbitrary size. 

Using the notation 1Zs n (resp. TZr n ) for the two-terminal availability between S and S n 
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o 




Fig. 1. Different source-terminal connections for a simple ladder. Edges and nodes are indexed by their availabilities: 

(ai,bi,Ci) and (5i,T<). 



(resp. T n ), we find that [16], [18] 



;i 0)M n •■• M 



^7 



(0 1 0) M n - - • M 





v / 
M 



V / 



(19) 



(20) 



The transfer matrix M n is given by 



M„ 



Q"n bn S n T n 



b-n Cn S n T n 



Q"n bn C n S n T n 
Q"n bn Cn S n T n 



\ 



~ci n b n S n T n 



^71 Syi T n 0"n ( 1 2 t)fx) Cn Sri T 7 



(21) 



) c n 5^ T n J 



For n = 0, we must set a = 1, and may choose c = because it does not change the final 
result. It is worth noting that all five availabilities of the n th "cell" or building block of the 
network appear in a single transfer matrix M n , which is not sparse, contrary to the matrices 
of Section [TIT1 

Equations ([7914211) apply to the most general ladder in terms of individual availabilities. If 
an edge or a node is missing, its reliability pi should be set to zero, and its failure rate may 
be considered arbitrary, because it will not alter the final result. Similarly, if a given edge 
or node is perfect, its reliability should be equal to one; its failure rate Aj should then, of 
course, be set to zero. 
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The associated matrix ML is 



mii 777.1 9 777 



12 "13 



777 i mL ml. 



22 
/ 



23 
/ 



m 31 m 32 m 33 



(22) 



with 



m' u 


— (^«n + ^S„) On , 






m' 12 


= (A&„ + A Cn + X Sn + A T J b n c n 


Sn T n 




m' 13 


= (A a „ + A bn + \ Cn + \ Sn + \r n 


) a n b n 


C n S n T n 


m' 21 


= (K n + \ n + A 5n + A T J On b n 


q rp 


1 


m'22 


— (A Cn + At„) c n T n , 






m> 23 


= m 'n, 






m' 31 


= -m'21 » 






™32 


= -"i' 12 , 






™33 


= (A a „ + A Cn + A 5n + A T J a n c n 


q rp 

u n - 1 n 


~ 2 m 'l3 



When ai = bi = Ci = p and Si = T { = p the three eigenvalues Co an d C± °f the transfer 
matrix are [16] 



Co = pp(l-pp), 

C± = y (i + 2p(i-p)p±V]b) , 

with £> = 1 + 4p 2 p — 8p 3 p 2 + Ap 4 p 2 . The two-terminal availabilities are [16] 

I / /-n+l _ Ati+1 



(23) 
(24) 



{l-2p + pp)p s p 



An Ar 

3 „3 *»+ S- 



K Sn (P, P) 



C+-C- 

/n+l _ /n+l 

2/ , k +Q +1 +pp(^ + pp) \ + _ c ~ 



(25) 



'1 - 2p + pp)p s p' 



An / t r. 

3 „3 *»+ S- 



(26) 



C+-C- 

These expressions are identical except for the ± sign in front of the Q +1 term. Assuming 
that the common link failure rate is A while that for the nodes is £, the failure frequency for 
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the Sn — ► T„ connection is 



v 



Xp ^ + £/?■ 



(27) 



dp ~" ' <9p 

a similar expression applies to TZs n (p,p). When n is large, both availabilities are actually of 
the form a + because the modulus of £+ is larger than that of the remaining eigenvalues 
for < p < 1 [16]. When nodes are perfect, we have therefore in this limit 



v 



Xp -^Q + na + — Q 



so that the failure rate is 



A « A 



dp 

c?lna + 
<91np 



+ n 



dp 
<91np 



(28) 



(29) 



-1 + 4p - 6p 2 + 4p 3 + (3 - 4p) VI + 4p 2 (l -p) 2 

2(1 -p) v/l + 4p 2 (l-p) 2 
A-5p + 8p 2 - 20 p 3 + 16 p 4 - 4p 5 - (4 - 7p + 4p 2 - 2p 3 ) 



/ l + 4p 2 (l - Pi 



with 

ainC+ 

<91na + 

dlnp ~ 2(1 -p) [1 + 4p 2 (l -pY 

The variations with p of <91nC + /<91np and <91na + /<9mp are displayed in Figs. [2] and |3] 

Since \C,+ \ < 1 for p < 1, the contribution of <91na + /<91np will prevail, and A will have 

a linear dependence with n in the large network limit (this is a general property when the 

eigenvalue of highest modulus is different from unity). 



(30) 




Fig. 2. Variation of dln(^+/d\np with p for a simple ladder with perfect nodes (eq. i30\ ). The maximum, reached for 
p w 0.251641, is about 1.13827. 



For very reliable components, eqs. (12^11311) simplify. It is easy to show by a series expansion 
in the link unavailability q that the global failure rate is given (to first order) by A — > 
(2 n + 4) A q; this result could also have been obtained by visual inspection and enumeration 
of the minimal cuts. 
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Fig. 3. Variation of d\na+/d\np with p for a simple ladder with perfect nodes (eq. (31}), The maximum, reached for 
V ~ 0.709902, is about 0.458825. 



We have shown that the linear differential method for computing the failure frequency is a 
very simple and useful one for A;-out-of-n systems as well as the two-terminal availability for 
recursive networks (this should hold for the all-terminal availability, too [18]). Its application 
is not limited to the case of extremely reliable components. Even though we restricted our 
discussion to expressions dealing with availabilities, a similar treatment could be performed 
for expressions where unavailabilities are the input data (see eq. ©). For more complex 
networks, the size of the transfer matrix increases (for instance, it is a 13 x 13 one for the 
'street 3 x rC of [26]) but the calculations remain straightforward. Finally, the expressions 
given for steady-state availabilities can also be used for time-dependent systems provided 
that failures and reparations are still statistically independent events, because the expressions 
are formally identical (the availabilities of components must be replaced by the reliabilities). 



V. Conclusion and outlook 



Appendix I 



Proof of eq. (fT3l) 



A k , n is given by 




(32) 



The fundamental equality between binomials 




(33) 
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leads to 

A k , n +i = {l-p) A k>n + p A k _^ n . (34) 



Setting Q k {z) = ^ A k, n z n implies 



n=0 

-Q h {z) = {l-p)Q k {z)+pg k -i{z). (35) 

so that 

Since A 0>n = l,Vn, Qo(z) = 1/(1 — z); eq. (fl"3l follows. 
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